Skip to main content

Overview of Datasets

The Dataset section allows users to explore, manage, and utilize various datasets designed for training, testing, and fine-tuning AI models. Users can access a curated collection of datasets, which can be filtered by formats such as CSV, JSON, and Text.

  • Dataset Filtering: Easily filter datasets by format, including CSV, JSON, Text, and JSONL.
  • Top-Rated Datasets: The most popular datasets are highlighted, providing quick access to the best resources available.
  • Search Functionality: Use the search bar to quickly find datasets by name or keyword.

How to Navigate the Dataset Overview:

  1. Browse Available Datasets: Explore the list of available datasets and use filters to narrow the selection by format.
  2. Search for Specific Datasets: Enter keywords in the search bar to find a dataset based on your needs.
  3. View Detailed Dataset Information: Click on any dataset to access its full details, including its description, format, and size.

Adding New Datasets:

There are two main ways to add a new dataset:

  1. Integration: Choose Integration to directly import datasets from integrated platforms or data sources. This ensures seamless data flow between your dataset sources and AI models.
  2. Database Import: Select Database to upload datasets from a structured database format. This is perfect for working with relational databases or other structured datasets.

Key Advantages:

  • Flexible Dataset Types: Users can add various dataset types (CSV, JSON, etc.), which are crucial for training and testing AI models across different use cases.
  • Top-Rated Datasets: Access to the most liked and reliable datasets helps ensure that you work with high-quality data for model optimization.
tip

To achieve the best results when training or fine-tuning models, ensure that the dataset is clean, well-structured, and properly formatted. Always review dataset previews before uploading.


Example Workflow:

  • Selecting a Dataset: Browse through the available datasets and choose one that aligns with your training or testing requirements.
  • Adding New Datasets: Quickly add a new dataset by clicking the New Dataset button, where you can choose between the Integration or Database options depending on your data source.

The screenshot below shows a filtered dataset list, highlighting essential details such as the dataset's name, format (CSV, JSON, etc.), and its size.